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Abstract 

o 

Given a discrete stochastic process, for example a chemical reaction system or a birth and death 
process, we often want to find a continuous stochastic approximation so that the techniques of stochastic 
differential equations may be brought to bear. One powerful and useful way to do this is the system size 
expansion of van Kampen to express a trajectory as a small stochastic perturbation to a deterministic 
trajectory, using a small parameter related to the volume of the system in question. This is usually 
pursued only up to first order, called the Linear Noise Approximation. The usual derivation of this 
proceeds via the master equation of the discrete process and derives a Fokker-Planck equation for the 
stochastic perturbation, both of which are equations for the evolution of probability distributions. Here 
we present a derivation using stochastic difference equations for the discrete process and leading, via the 
chemical Langevin equation of Gillespie, directly to a stochastic differential equation for the stochastic 
perturbation. The new derivation, which does not yield the full system size expansion, draws more 
, 1 * explicitly on the intuition of ordinary differential equations so may be more easily digestible for some 

audiences. 
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1 Introduction 

s 

^ Given a discrete stochastic process, for example a chemical reaction system or a birth and death process, 
we often want to find a continuous stochastic approximation so that the techniques of stochastic differential 
equations may be brought to bear. One powerful and useful way to do this is the system size expansion of 

i— —i Van Kampen [13], which has been proved useful in applications in biophysics [4], epidemiology jT], ecology 
|llj and neuroscience [21 E] . If the discrete process has a size or volume parameter $7 for which reaction rates 
j> scale appropriately, may be used as a small parameter. Inspired by the central limit theorem, we 

( ^ approximate the discrete random variable K as the sum of a continuous deterministic variable x, scaled with 
Q, and a smaller continuous stochastic perturbation £ scaled with in other words K = Qx + \/T&£. The 
chief application for this is to derive a linear noise approximation, meaning that the expansion is pursued 

^ to first order in in which the stochastic perturbation satisfies a Fokker-Planck equation with linear 

drift term and constant noise term, both calculated from the deterministic solution. The small parameter 
may be used to check the quality of the approximation, calculate higher-order corrections to moments, and 
• • so on, but this is almost never done. 

. J_h The usual method of deriving the system size expansion is 

X 

5_i 1. Express the discrete process in terms of its master equation. 



2. Make the ansatz K = £lx + yQ,^. 

3. Re-express the difference terms in the master equation in terms of differential operators in the new 
variables. 

4. Taylor expand the coefficients of the resulting equation about the deterministic term x. 



5. Collect terms to find a deterministic 0(vf2) equation for x and a Fokker-Planck equation for £ with 
linear terms of O(l), and terms of higher order in 1/vTL 
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While this is a perfectly rigorous way to derive the approximation, it involves a lot of notation, and can be 
a challenge to present to audiences who are unfamiliar with the master equation or Fokker-Planck equation. 
Many scientists and mathematicians are familiar with ordinary differential equations (ODEs), and with the 
idea of a stochastic differential equation (SDE) as an ODE with a random input, but are unused to thinking 
of evolution equations for probability distributions, and the extra steps involved in explaining such evolution 
equations may prove an obstacle to the success of a talk or exposition. 

Here I present an alternative derivation of the linear noise approximation phrased entirely in SDE 
notation. Given the same scaling hypothesis regarding transition rates, we may make use of the chemical 
Langevin equation, or r-leaping, due to Gillespie [8, 9j. This relies on a timescale r at which there are many 
transitions per channel but over which the reaction rates do not change appreciably, so that the number of 
reactions per channel is approximately Gaussian over that timescale. The derivation proceeds 

1. Express the discrete process in terms of its transition rates. 

2. Given an appropriate timescale r, re-express the increment random variables as Gaussians, giving rise 
to an SDE. 

3. Make the ansatz K = Q,x + \/Ti^. 

4. Taylor expand the coefficients of the resulting equation about the deterministic term x. 

5. Collect terms to find a deterministic 0{^/0) equation for x and a Langevin equation for £, with linear 
terms of 0(1), and terms of higher order in 

The deterministic variables x we arrive at satisfy the same equations and so are identical. Using the Ito 
interpretation, which is reasonable for systems with internal noise |13j . the stochastic perturbation £ also 
corresponds in the two derivations to linear order. 



2 Derivation for chemical reaction systems 



Let K(i) be a vector of the numbers of different chemical species, subject to reaction channels K — > K + Vj 
each at rate firj(K/fi) 3 where the r,-'s are polynomials or other suitably nice functions. Note that we 
have built the system-size parameter into the reaction rates; discussions of why this is reasonable may be 
found in [5J [13] . Now suppose there is a timescale r at which the number of reactions per channel is large 
but the reaction rates rj do not change appreciably. Known as r-leaping, this is discussed extensively in 

[HI [TO] ; under our scaling hypothesis this occurs roughly when — — <c r c 



change in a quantity in time r. Since the number of reactions per channe 
and independent of the number in other channels, the evolution of K(t) is described by 

K(t + r) « K(t) + ^A/'(rOr j (K/0),rfir j (K/fi))vj 



, where A T indicates the 

A rrj 

is then approximately normal 



(1) 



= K(t) +r^^r j (K/fi)v j + ^(K/ft) Af (0, 1) v,- (2) 

j j 

which is the "chemical Langevin equation" described in [7J. 

Now we make the ansatz of K = J7x + vO£, a deterministic solution with a stochastic perturbation. 
Then ([2]), divided through by Q, becomes 

*(« + T ) + 4j«(« + r) 



*M + jgf + - E r, ■ (x(0 + ^{ ) v, + ^1 £ Jr, (x( t ) + ± ( )m (0, 1) Vj . (3) 
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As promised, we now Taylor expand the reaction rates so that 

r '( x + ^t f ) =^ + ^(0 + ^(0 + ... (4) 

where we denote by Dx r j(0 the mth directional derivative of rj, evaluated at x, in the direction of £. 
Substituting this in ([3]), we obtain 



x(t + r) + -={(( + t) 



1 1 



(x) + —D^Mt)) + „2^^W) + • • • 



ri (x) + ^Acr.-cew) + + 



AT(0,l) Vj . (5) 



Collecting terms of 0(1) in ([5]), we have a deterministic expression for the concentration term x 

x(i + r) =x(i)+r^r j (x)v i 

i 

which may be expressed as an ordinary differential equation 

dx(t) 



5Z r i( x ) v i 



(6) 



(7) 



which is the reaction rate equation. Collecting terms of O \ an d higher, we obtain a stochastic expression 
for the perturbation term £ whose coefficients are evaluated at the deterministic solution x(i), 

1 1 



r,-(x) + -Lz> xrj (e(t)) + Hz>2 ri (e(t)) + . . 



AT(0,l) Vi . (8) 



We may also write this equation in Langevin form, using the Ito interpretation 

dm 



dt 



E 



1 1 

H2" 



£> x r i (£(t)) + ^l£r i (f(t)) + ... 



1 1 



x) + — D x rj(£(t)) + ^DgrMt)) + 



W ( 9 ) 



where the rjj are independent white-noise variables, one for each reaction channel. It would be possible to 
replace these with one independent white noise per chemical species, calculating the coefficients from the 
last term in (|9l). 



Neglecting the terms in O an d higher, we reproduce the linear noise approximation 

dm 

dt 



(10) 
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This detailed derivation might still present a challenge to explain to a mixed audience; since the system size 
expansion is almost always truncated at the linear noise approximation, one could omit terms of order ^ 
and higher earlier on, and write down the linear noise approximation directly from ([3]). 

The derivation presented here comes from the chemical Langevin equation, which is formally equivalent 
to the Fokker-Planck equation, and so could not be used to make the full system size expansion, which is 
derived from the infinite-order Kramers-Moyal equation. 



3 Derivation for birth-death processes 

The derivation simplifies slightly for birth-death processes, i.e. discrete processes in which the only tran- 
sitions are K — > K ± e», where ej is the vector incrementing the ith population by 1. These can also be 
thought of as (inhomogenous) random walks on a square lattice. Our scaling assumption is incorporated by 
specifying the rate of the transition K — > K ± ej as £ltf(k/£l). 

Again suppose there is a timescale r at which the number of reactions per channel is large but the size- 



scaled reaction rates tj do not change appreciably; this occurs roughly when 



1 

fit. 



< T < 



tjT 



A T tj 



, where A T 



indicates the change in a quantity in time r. In this case the increment random variables are approximately 
normal, and the evolution of K(i) is described by 



K(t + r) K(t) + ^2Af (rnt+K/n), rSUf(K/n)) e t - ^ M (r^K/Oj.rOt: (K/fi)) e* 



K(t) + tQ £ [t+ (K/n) - t~ (K/fi)] e, + VrQ £ y/ [t+ (K/fi) + t~ (K/O)] N (0, 1) e« (12) 

i i 

K(t) + Ai(K/n)ei. + ^ v^ilP)^ (o. i) e * (is) 



Since the Gaussian increments for the increase and decrease of each population are independent, we have 
combined them into a single Gaussian for each population, and introduced the drift and diffusion vectors 
respectively, 



Ai(x)=t+(x)-t-(x), 
fli(x) = t+(x) + tr(x). 

Making the ansatz K = fix + V^C, we obtain equations 

x(i + r) =x(t) + A(x)r 
which translates to the ordinary differential equation 

dx(t) 



dt 



Afx) 



The stochastic perturbation £ evolves as 



£(t + r)=^)+r^ 



1 1 



(14) 
(15) 



(16) 



(17) 



B, + -1= (D^Bi) m) + ~ (^^) (^)) + . 



Af(0,l)ei (18) 
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where the coefficients are evaluated at the deterministic solution of (17), A, = Aj(x(i)) and Bi = Aj(x(i)) 



We may again write this equation in Langevin form, using the Ito interpretation. Let us pass directly to 
the Langevin form of the linear noise approximation, neglecting terms of order and introducing the 
Jacobian 



Jii = diA^t)) = di t|(x(t)) - tT(x(t)) 



The linear noise approximation is then 



dt 



(19) 



(20) 



where rji(t) is a vector of independent white noise variables, one for each population. Amongst the strengths 
of this derivation is to show clearly how the lack of correlations between the white noise inputs to the linear 



noise approximation (20) arises from the elementary transitions in a birth-death process incrementing only 



0: 



one population at a time. 

Another simplification is, if the deterministic solution (17) is at a fixed point, the drift term A 
since Aj(x) = (x) — i~(x) = 0, we have -Bi(x) = 2i^~(x) = 2i^( x )- This means one may choose how to 
express the noise amplitude Bi at a fixed point, according to whether tf or t~ has a simpler form. 



4 Discussion 



This note is intended to clarify a technical point in deriving a tractable continuous approximation for 
discrete stochastic processes. The linear noise approximation is more precise than the general weak noise 
approximation described in [5] since the system size supplies a physically-motivated small parameter. The 
derivation of the linear noise approximation presented here suggests also that the conditions for the chemical 
Langevin equation to hold are generally also needed for the linear noise approximation to hold. In particular, 
this requires a "physical" condition relating a timescale to the system size and transition rates. This is in 
contrast to some of the bolder promises made about the system size expansion: since the system size 
expansion goes to infinitely many orders in a small parameter, it appears that one may improve its accuracy 
by proceeding to higher orders in the expansion. However, this is never done; the author is aware of very 
few pieces of work that make use of higher orders in the system-size or Kramers- Moyal expansions [12J, and 
this work does not deal with an application. Given that uses of the system size expansion invariably stop 
at the linear noise approximation, one may as well derive it as directly and comprehensibly as possible. 
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